Trip Purpose Imputation Using GPS Trajectories with Machine Learning
نویسندگان
چکیده
We studied trip purpose imputation using data mining and machine learning techniques based on a dataset of GPS-based trajectories gathered in Switzerland. With large number labeled activities eight categories, we explored location information hierarchical clustering achieved classification accuracy 86.7% random forest approach as baseline. The contribution this study is summarized below. Firstly, from GPS exclusively without personal shows negligible decrease (0.9%), which indicates the good performance our steps wide applicability scheme case limited availability. Secondly, dependence model geographical location, participants, duration survey investigated to provide reference when comparing accuracy. Furthermore, show ensemble filter be an excellent tool research field not only because increased (93.6%), especially for minority classes, but also reduced uncertainties blindly trusting labeling by vulnerable class noise due response burden. Finally, derivation across participants reaches 74.8%, significant suggests possibility effectively applying trained small subset citizens larger trajectory sample.
منابع مشابه
Exploring trip fuel consumption by machine learning from GPS and CAN bus data
This study aims to explore the trip fuel consumption from a large-scale dataset. To better understand how the multiple variables (e.g., average travel speed, trip distance) influence the trip fuel consumption, we propose the support vector machine (SVM) to learn the relationship between the trip fuel consumption and the corresponding factors. A large-scale GPS and CAN (Controller Area Network) ...
متن کاملImputation of Missing Data Using Machine Learning Techniques
A serious problem in mining industrial data bases is that they are often incomplete, and a significant amount of data is missing, or erroneously entered. This paper explores the use of machine-learning based alternatives to standard statistical data completion (data imputation) methods, for dealing with missing data. We have approached the data completion problem using two well-known machine le...
متن کاملUsing Classifier-Based Nominal Imputation to Improve Machine Learning
Many learning algorithms perform poorly when the training data are incomplete. One standard approach involves first imputing the missing values, then giving the completed data to the learning algorithm. However, this is especially problematic when the features are nominal. This work presents “classifier-based nominal imputation” (CNI), an easy-to-implement and effective nominal imputation techn...
متن کاملJ3.9 Multiple Imputation through Machine Learning Algorithms
A problem common to meteorological and climatological datasets is how to address missing data. The majority of multivariate analysis techniques require that all variables be represented for each observation; hence, some action is required in the presence of missing data. In cases where the individual observations are thought not important, deletion of every observation missing one or more piece...
متن کاملA machine learning framework for TCP round-trip time estimation
In this paper, we explore a novel approach to end-to-end round-trip time (RTT) estimation using a machine-learning technique known as the experts framework. In our proposal, each of several ‘experts’ guesses a fixed value. The weighted average of these guesses estimates the RTT, with the weights updated after every RTT measurement based on the difference between the estimated and actual RTT. Th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ISPRS international journal of geo-information
سال: 2021
ISSN: ['2220-9964']
DOI: https://doi.org/10.3390/ijgi10110775